Exploiting Keyword Structure for Domain-Specific Retrieval

نویسنده

  • Jaap Kamps
چکیده

Structured elements, such as manually assigned keywords or key-phrases in scientific collections, are pervasive in digital libraries. Special dictionaries or thesauri for the meta-information are not always available. Our strategy is to compute the similarity of keywords based on their occurrence in the collection. The resulting keyword space is brought to bear on a variety of tasks. Combined with an information retrieval system, we can recover keywords for queries, and thus provide a technique can be used for automatic classification. Moreover, it can be used to rerank retrieved documents, leading to a significant improvement of retrieval effectiveness in domain-specific collections. Experimental evaluation is done on the German GIRT and French Amaryllis collections, using the test-suite of the Cross-Language Evaluation Forum (CLEF).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving relevance in search through Ontology and Query Expansion

Prof. Pushpak Bhattacharyya Computer Science and Engineering department Bachelor of Technology Improving relevance in search through Ontology and Query Expansion by Anirudh Vemula From the inception of Semantic Web in the late 20th century, ontology has been a major focus to achieve the idea of semantic search. In this work, we will review different approaches that have been employed over the y...

متن کامل

Semiautomatic Image Retrieval Using the High Level Semantic Labels

Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...

متن کامل

Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback

Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...

متن کامل

Exploiting Domain Thesaurus for Medical Record Retrieval

InfoLab at the University of Delaware participated in the TREC 2012 Medical Records Track. This paper explains our method and describes experiment results. One limitation of existing keyword matching based retrieval functions is the problem of vocabulary mismatch. To overcome this limitation, we propose to first map topics and visits to bags of concepts using domain thesaurus, and then model th...

متن کامل

Expanding Queries Using Stems and Symbols

This paper describes the experiments conducted in the ad-hoc retrieval task of the Genomic track at TREC 2004. Different query expansion techniques based on the addition of keyword stems and of genomic product symbols selected by relevance feedback were studied. Stemming was tested using a mutual reinforcement process for building a domain-specific stemmer. Relevance feedback was tested using a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002